http://www.ebi.ac.uk/c9i-bin/cxpasyfetch7U79117 

ID U79117 Standard; DNA; PRO; 3573 BP. 

XX 

AC U79117; 
XX 

SV U79117.1 

XX 

DT 04-SEP-1998 (Rel . 56, Created) 

DT 04-SEP-1998 (Rel. 56, Last updated, Version 1) 
XX 

DE Clostridium thermocellum S-layer protein (slpA) gene, complete cds. 

XX 

KW 

XX 

OS Clostridium thermocellum 

OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; 
OC Clostridium. 

XX 

RN [1] 
RP 1-3573 

RX MEDLINE; 98129094 . 

RA Lemaire M. , Miras I., Gounon P., Beguin P.; 

RT "Identification of a region responsible for binding to the cell wall within 
RT the S-layer protein of Clostridium thermocellum"; 
RL Microbiology 144 (Pt) : 0-0 (0) . 
XX 

RN [2] 
RP 1-3573 

RA Miras I., Beguin P.; 
RT 

RL Submitted (20 -NOV-1996) to the EMBL/GenBank/DDBJ databases. 

RL Biotechnology, Institut Pasteur, 28, rue du Dr. Roux, Paris 75724 Cedex 15, 

RL France 

XX 

DR SPTREMBL; 086999 ; 086999. 
XX 

FH Key Location/Qualifiers 
FH 

FT source l. .3573 

FT /db_xref " t axon : 1515 " 

FT /mol_type= "genomic DNA" 

FT /organism= "Clostridium thermocellum" 

FT /strain="NCIB 10682" 

FT CDS 145. .3255 

FT /codon_start=l 

FT /db_xref=" SPTREMBL :086999" 

FT /transl_table=ll 

FT /gene="slpA" 

FT /product= "S-layer protein" 

FT /protein_id= " AAC33404.1 " 

/translation="MKNLKKVLAVLWISVISTLLVPAFADSFSYEKEAEILYRLGLYK 
GTSETEYVPNLEGKLDRQTGWMLLRLFGQEDDALEIPMDEAAQTLAAKFKDAADIADW 

FT AQRQVAYAVEKGYVKGYPDGTFLPNADLNGLAFCSLILQQLGYDGDFVFDEAAYKLQEF 

^'^ GGLTAEQAEAFNNKNGINRDSMVGIAFSALQAVYKATGKTVIEVLVENGNVSKELAIEL 

FT GVLLKAIKEVKALDAVKVQVGKEPVLPEEVEWYEDDTTEKLAVEWPTVDTSEVGEQEI 

EGTIKGASGLAYREPKATLKVIVTPEELQWDVKAPNLKEIVIEFNGEVASKADEKSSY 

FT SVEDNTIELVTVSEDKTTVTLTVAGAMTAEEEIEVTIKTATGLKEEVTKTWPADYENP 

EAESIALIGPNSFEIKFSEPVQSSSDAEVLVNDGTYYVSEEKLSQDYRTLTVELGVSSL 

FT NEGTYKVKVKGYRDYAGNIMRTKTFDLEYVKDTTPPTAKVKEATQNKWIEFNEPATRD 

GYSGDEAALTRDYFYQTYSSWKPTKWASDNNKVYTLYFSEDQNDGGYPVYLLPVGNVT 
ITILKEVDDDAWDAWGNKLESDLKLTATVAADNEAPTVKSVTAEAEDKIVWFSEDVN 
ENQAKDKDNYVIKKDGKEIDTAISSITYDSNETKVTIVLDEKLSGGKYTIDIKGIKDTS 
VSENEMKAVTIEFEVTDKTAPTIEEVTFVDNYIYVRYSEAMSTKGNGSVLNKDNYKLVD 
DNDKKVEIKKIELFGSDKNKVRITVDSDVDLNVDYELTIANVEDEAGNAISAFDVKAKK 
LSEEQAPEVSEIRIISKTEIEIVINKILDKATVEKTDFEVERGSNKVALTRISSITYDD 
GKTIVKGVLPDAVRPANSGDITGYTLYIVGEIKSDTGKEMATGAVSKPVDDKFAPSFVS 
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FT VANGVYGDASKKGFTLTFDEDIKFLNNSAGLGATDLVIKNGSKTLEAGIDYDVAAIDNK 

FT ITVTLKGDDYADFTGTLKVSTKDTVKYITDEAGNALNKFENKEVKVQ" 

XX 

SQ Sequence 3573 BP; 1290 A; 601 C; 792 G; 890 T; 0 other; 

gcttgaattc aagataggtt gtgtgcgcca gtcttgaaga aaaagcaaga taggtcaggt 60 

taaaaaacaa aagggataag gcgtagcctg gctttaattt caagcctgct tcccaaaaaa 120 

aacatacaag gaggatttga gagtatgaag aatctcaaaa aagtgctggc agtgttagtt 180 

gtaatctcag tgatttcaac tctcttagtg cctgcatttg ctgattcatt cagctatgaa 240 

aaagaagcag aaattttgta taggctcggc ttatacaaag gaacatcaga aacagagtat 300 

gttccaaact tggaaggtaa acttgacaga cagaccggag tagttatgct cctcagactg 360 

ttcggtcagg aagacgatgc attggaaatt ccaatggatg aggcagctca gacacttgct 420 

gctaaattca aagatgcagc tgacattgca gactgggcac aaagacaagt ggcttatgca 4 80 

gttgaaaagg gatatgtaaa aggttatccg gacggaacat tccttccgaa cgcagacctc 540 

aacggcttgg ctttctgctc attgattctt cagcagttgg gatatgacgg agactttgtt 600 

ttcgatgaag ctgcgtacaa gttgcaagag tttggcggct tgactgcaga acaagctgaa 660 

gcgttcaaca acaagaacgg aatcaacaga gactcaatgg ttggtattgc tttctcagct 720 

ttgcaggctg tatacaaagc tacaggaaag acagttattg aggttctcgt agagaacgga 7 80 

aatgtttcca aagaacttgc tatagaactc ggtgttcttt tgaaagccat caaggaagta 84 0 

aaagctttgg atgctgttaa ggttcaggtt ggaaaagaac ctgtacttcc tgaagaagta 900 

gaagtagtat atgaagatga cacaactgaa aaacttgcag ttgaatggcc tacagttgat 960 

acttcagaag ttggtgaaca ggaaatcgaa ggtactatca aaggtgccag cggtttggct 1020 

tacagagaac caaaggctac tctcaaggtt atagtaacac ctgaagaact ccaagttgta 1080 

gatgttaagg ctcctaacct taaggaaatt gtaattgaat tcaacggaga agtagcttca 1140 

aaagctgatg aaaaatccag ctactcagtt gaagacaata ctattgaatt ggttacagta 1200 

tcagaagaca agactacagt tacattgaca gttgctggtg ctatgacagc tgaagaagaa 1260 

atcgaagtaa caattaaaac agcaactggc ttgaaggaag aagttactaa gactgtagta 1320 

cctgctgact acgaaaatcc ggaagctgaa tccattgctt tgataggtcc gaactccttt 1380 

gaaattaaat tctcagaacc tgttcagagc agctcagatg cagaagttct cgttaatgac 1440 

ggaacttatt atgtaagtga agaaaaactg tcacaggact acagaacatt aactgtagaa 1500 

ctgggcgtaa gttcattgaa tgaaggaact tacaaagtaa aagttaaagg ttacagagac 1560 

tatgctggaa acataatgag aacaaagacc tttgacttgg agtatgtaaa agatacaact 162 0 

cctccaactg ctaaagtaaa agaagcaaca cagaacaaag tagtaattga attcaatgag 1680 

cctgctacaa gagatggtta ctctggtgat gaagcagctc ttacaagaga ttacttctat 1740 

cagacatatt cttcctggaa gccaactaaa gttgtagctt cagacaataa caaagtttat 1800 

actttgtact tctctgaaga ccagaacgat ggtggctatc ctgtatatct gcttccggta 1860 

ggaaacgtta ctataacaat cctcaaggaa gtagatgatg acgctgtagt agatgcatgg 192 0 

ggcaacaagc tcgagtccga tcttaaactt actgctacag tagcagccga taatgaggct 1980 

ccaacagtaa aaagtgtaac agctgaagca gaagataaaa tcgtggttgt attcagcgaa 2040 

gatgtaaacg agaaccaggc aaaagataag gacaactatg taattaagaa agacggaaaa 2100 

gaaatagaca cagctatttc aagcatcaca tatgacagca atgaaacaaa ggtaacaatt 2160 

gttttggatg aaaagcttag tggtggaaaa tatacaatag atattaaggg tattaaagat 2220 

acttcagtat ctgaaaacga aatgaaagca gttactattg aatttgaagt aactgacaag 2280 

acagctccga caattgaaga agtaacattt gttgacaact acatctatgt aagatacagc 2340 

gaagctatgt caacaaaggg caacggttca gtactgaaca aggacaacta caaactcgta 2400 

gatgacaacg ataagaaagt agaaattaag aaaattgaat tgtttggctc tgacaagaac 2460 

aaagtaagaa taactgttga tagtgatgta gatcttaacg tagattacga actcacaatt 2 52 0 

gctaacgttg aggatgaagc tggtaatgct ataagtgcat tcgatgttaa ggcaaagaaa 2580 

ttgagtgaag agcaagcacc agaagtatca gaaattagaa ttatcagcaa gactgaaatt 2640 

gaaatagtaa ttaacaagat ccttgacaag gcaactgttg aaaagactga ctttgaagta 2700 

gaaagaggca gcaacaaagt tgcactcaca agaataagct caatcaccta tgatgatggt 2760 

aaaacaatag ttaagggtgt actcccggat gcagtacgtc ctgctaactc cggagacatc 2820 

acaggttata cgctctacat tgtgggtgaa attaaatccg atacaggtaa ggaaatggca 2880 

acaggagcag tttcgaagcc agttgatgat aagtttgcac caagctttgt aagcgtagct 294 0 

aatggtgtat acggcgatgc atcaaagaaa ggattcacat tgacattcga tgaagacatc 3000 

aagttcttga acaactcagc tggtttgggt gcaaccgacc tcgtaatcaa gaacggtagc 3060 

aagactcttg aagctggtat cgactatgat gtagcagcta tcgataacaa gataacagtt 312 0 

acactcaaag gtgacgacta tgctgacttc acaggaactc ttaaagtttc aactaaggat 3180 

actgtgaagt atatcacaga cgaggctggt aatgcactca acaagttcga aaataaagag 3240 

gtaaaggttc aataatctta cttcggataa aaaggcttcc caaatgggaa gccttttttc 3300 

atacaaaaaa gcctccttat atttctaatt gacatcaaaa cattaattat atacaataat 3360 

aatgatttaa agttaacaaa taggcaattc aaaaggagta ctgcacaatg gcattactcc 342 0 

atcattattg gattcccgcc taaaaagaga ctatggcttt atatcctgtg agataaacgc 34 80 

agatgggggc agaatagatg gaagagataa gcttgagaga attaattgaa ataatcatta 3540 

aaggtaaatg gattatagta gctgtaactg cag 3573 
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Entry information 


Entry name 


086999 


Primary accession number 


086999 


Secondary accession numbers 


None 


Entered in TrEMBL in 


Release 08, November 1998 


Sequence was last modified in 


Release 08, November 1998 


Annotations were last modified in 


Release 24, June 2003 


Name and origin of the protein 


Protein name 


S- layer protein 


Synonyms 


None 


Gene name 


SLPA 


From 


Clostridium thermocellum [TaxID: 15151 


Taxonomy 


Bacteria: Firmicutes: Clostridia: 




Clostridiales: Clostridiaceae: Clostridium. 
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References 



[1] SEQUENCE FROM NUCLEIC ACID. 
STRAIN=NCIB 10682; 

MEDLINE=98129094; PubMed=9467913; [NCBI, ExPASy . EBI . Israel . 
Japan 1 



Lemaire M. . Nilras I. . Gounon P. . Beguin P. : 



"Identification of a region responsible for binding to the cell wall within 
the S-layer protein of Clostridium thermocellum."; 

Microbiology 144^0-Oroy 



Comments 


None 


[Cross-references 


EMBL 


U79117; AAC33404.1; FEMBL / SenBank / DDBJl 

rCobinqSequence] 


PIR 


T30311; T30311. 


InterPro 


IPR001119: SLH. 

Graph leal view of domain structure. 


Pfam 


PF00395; SLH; 1. 


ProDom 


rbomaln structure / List of sea. sharing at least 1 domain] 


HOBACGEN 


[Family / Alignment / Treel 


ProtoMap 


086999. 


PRESAGE 


086999. 


ModBase 


086999. 


SWISS-2DPAeE 


Get region on 2D PAGE. 


Keywords 





None 



Features 



None 



Sequence information 


Length: 
1036 AA 


Molecular weight: 
113330 Da 


CRC64: 1AF2202060D7C2A1 [This is a 
checksum on the sequence] 
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NiccProt Vlevw^f TrEMBL 086999 httpi//us.cxpasy,or9/c9i-bln/niceprot.pl?086999 
10 20 30 40 50 60 

I I I I I I 

MKNLKKVLAV LWISVISTL LVPAFADSFS YEKEAEILYR LGLYKGTSET EYVPNLEGKL 

70 80 90 100 110 120 

I I I I I I 

DRQTGWMLL RLFGQEDDAL EIPMDEAAQT LAAKFKDAAD lADWAQRQVA YAVEKGYVKG 

130 140 150 160 170 180 

I I I I I I 

YPDGTFLPNA DLNGLAFCSL ILQQLGYDGD FVFDEAAYKL QEFGGLTAEQ AEAFNNKNGI 

190 200 210 220 230 240 | 

I I I I I I 

NRDSMVGIAF SALQAVYKAT GKTVIEVLVE NGNVSKELAI ELGVLLKAIK EVKALDAVKV 

250 260 270 280 290 300 

I I I I I I 

QVGKEPVLPE EVEWYEDDT TEKLAVEWPT VDTSEVGEQE lEGTIKGASG LAYREPKATL 

310 320 330 340 350 360 

I I I I I I ! 

KVIVTPEELQ WDVKAPNLK EIVIEFNGEV ASKADEKSSY SVEDNTIELV TVSEDKTTVT 

370 380 390 400 410 420 

I I I I I I 

LTVAGAMTAE EEIEVTIKTA TGLKEEVTKT WPADYENPE AESIALIGPN SFEIKFSEPV 

430 440 450 460 470 480 

I I I I I I 

QSSSDAEVLV NDGTYYVSEE KLSQDYRTLT VELGVSSLNE GTYKVKVKGY RDYAGNIMRT 

490 500 510 520 530 540 

I I I I I I 

KTFDLEYVKD TTPPTAKVKE ATQNKWIEF NEPATRDGYS GDEAALTRDY FYQTYSSWKP 

550 560 570 580 590 600 

I I I I I I 

TKWASDNNK VYTLYFSEDQ NDGGYPVYLL PVGNVTITIL KEVDDDAWD AWGNKLESDL 

610 620 630 640 650 660 

I I I I I I 

KLTATVAADN EAPTVKSVTA EAEDKIVWP SEDVNENQAK DKDNYVIKKD GKEIDTAISS 

670 680 690 700 710 720 

I I I I I I 

ITYDSNETKV TIVLDEKLSG GKYTIDIKGI KDTSVSENEM KAVTIEFEVT DKTAPTIEEV 

730 740 750 760 770 780 

I I I I I I 

TFVDNYIYVR YSEAMSTKGN GSVLNKDNYK LVDDNDKKVE IKKIELFGSD KNKVRITVDS 

790 800 810 820 830 840 

I I I I I I 

pVDLNVDYEL TIANVEDEAG NAISAFDVKA KKLSEEQAPE VSEIRIISKT EIEIVINKIL 

850 860 870 880 890 900 

> I I I I I 

DKATVEKTDF EVERGSNKVA LTRISSITYD DGKTIVKGVL PDAVRPANSG DITGYTLYIV 

1910 920 930 940 950 960 

I I I I I I 

GEIKSDTGKE MATGAVSKPV DDKFAPSFVS VANGVYGDAS KKGFTLTFDE DIKFLNNSAG 
970 980 990 1000 1010 1020 
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1030 
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• http://www.cbi.ac.uk/cgi-bin/expasyfetchPAy004256 

ID AY004256 Standard; DNA; PRO; 2434 BP. 

XX 

AC AY004256; 
XX 

SV AY004256.1 

XX 

DT 03-DEC-2000 (Rel . 66, Created) 

DT lO-APR-2001 (Rel. 67, Last updated. Version 3) 

XX 

DE Clostridium difficile SlpA (slpA) gene, complete cds . 

XX 

KW 

XX 

OS Clostridium difficile 

OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; 

OC Clostridium. 

XX 

RN [1] 

RP 1-2434 

RX MEDLINE; 21189268 . 

RX PUBMED; 11292772 . 

RA Karjalainen T.K., Waligora-Dupriet A.J., Cerquetti M., Spigaglia P., 

RA Mauri P., Mastrantonio P.; 

RT "Molecular and genomic analysis of two genes encoding surf ace -anchored 

RT proteins from Clostridium difficile"; 

RL Infect. Immun. 69 (5) :3442-3446 (2001) . 
XX 

RN [2] 

RP 1-2434 

RA Karjalainen T.K.; 
RT 

RL Submitted (05- JUL-2000) to the EMBL/GenBank/DDBJ databases. 

RL Microbiology, Univ Paris-Sud, Faculty of Pharmacy, 5, rue JB Clement, 

RL Chatenay-Malabry 92296, France 

XX 

DR SPTREMBL; Q9EY85 ; Q9EY85 . 
XX 

FH Key Location/Qualifiers 
FH 

FT source 1. .2434 

FT / db_xr e f = " t axon : 1496 " 

FT /mol_type= "genomic DNA" 

FT /organism= "Clostridium difficile" 

FT /strain="79685" 

FT CDS 56. .2209 

FT /codon_start=l 

FT / db_xr e f = " S PTREMBL : Q9EY85 " 

FT /note="S layer protein" 

FT /transl_table=ll 

FT /gene="slpA" 

FT /product="SlpA" 

FT /protein_id=" AAF89093 . 1 " 

FT /translation="MSGLTVLASAAPVFAADVKAEYITVQKDYKDTLKKIQAGIKDGSI 

FT TNLWTYDKDKEVANYNYKSDATTADAKEIAATTLYNLVDSKLDNLGDGDLVSFNIKYD 

FT AAEKFHTKDEMDALKTKLENKEIVKPASETTAGLVMADGATDSKKADKSLYAKDVIKFD 

FT WSDTIGYKLTATPIADAQLATLKATYKYANNTKVEFASATELAATDGSAVEVAKGKEY 

^T NATGSLVFDSATGKTSNINVDPLTNKGDTWKVINAKESTIDIDSSTSTSAEDLAKKYV 

FT FDEDKLDDIYKELTSEEGYGNLVQLVSGRYQVALYPEGKRLDTKGATDIENTPVKLVLK 

FT ADKIKDMKDYIDDLRTYNNSYSNWTVAGEDRIETAIELSNKYYNSDDKHAITDSATDS 

FT WLVGSQAIVDGLVASPLASEKHAPLLLTSKDKLDSNVKSEIKRVMDLKSTSGINTSKK 

FT VYLAGGVNSISKEVENELKDMGLKVTRLSGDDRYETSLAIADEVGLDNDKAFWGGTGL 

FT ADAMSIAPVASQLKKSNGDLDWDGDATPIVWDGKAKTINNETEDFLNNAQVDIIGGE 

FT NSVSKDVEKSIWATGKEPNRTSGDDRQATNAEVMKETDYFEKGSVINYFVAKDGSTKE 

FT DQLVDALAAAPVAANFGSTYDGKNANGTVSPAPIVLATDSLSADQNVGVSKSVSDDGGK 

FT NLVQVGKGIASSVISKMKDLLDM " 
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XX 

SQ Sequence 2434 BP; 952 A; 287 C; 448 G; 747 T; 0 other; 

attatattac aaccctcctt aaattcttta cttattcttt ttatatcgtt atcgtatgtc 60 

aggtttaaca gtgttggctt cggctgctcc tgtattcgct gcagatgtaa aagctgagta 120 

tataactgtt caaaaagatt acaaagatac tcttaaaaaa atacaagcag gtatcaaaga 180 

tgggtcaata actaatttag ttgtgactta tgataaagac aaggaagttg caaattataa 240 

ttacaaaagt gatgcaacta cagcagatgc taaagaaata gctgctacta ctttatataa 300 

tctagttgat tcaaaattag ataatttagg tgatggagat ttagtatcat ttaatataaa 360 

atatgatgct gctgaaaaat tccatactaa agatgagatg gatgcactta aaactaaatt 420 

agagaataaa gagattgtta agccagcatc agaaactaca gctggtctag taatggctga 480 

tggtgctact gattcaaaaa aagcagataa aagtttatat gctaaagatg taattaagtt 540 

tgatgtagta tcagatacta taggatataa gcttacagca acaccaattg ctgatgcaca 600 

attagcaact ttaaaagcta cttataaata tgcaaataat acaaaggtag aatttgctag 660 

tgctacagag cttgctgcta cagatggttc agctgtagaa gttgcaaaag gaaaagaata 720 

taatgcgact ggctcacttg tatttgatag tgctactgga aagacttcta atataaatgt 780 

tgatcctctt actaacaagg gtgatacagt tgtaaaagtt ataaatgcaa aagaatcaac 840 

aatagatata gattcaagta ctagtacaag tgctgaagat ttagctaaga aatatgtatt 900 

tgatgaagat aaattagatg atatatataa agagttaact agtgaagagg gatatggaaa 960 

tttagtacaa ctagtaagtg gtagatatca agtagctctt tatccagaag gaaagagatt 1020 

agatactaaa ggagcaacag atatagaaaa tactcctgtt aaattagttc ttaaagcaga 1080 

taagataaaa gatatgaaag attatataga tgatttaaga acatataaca acagttattc 1140 

aaatgttgta actgtagctg gagaagatag aatagaaaca gctatagaat taagtaataa 1200 

atattataat tctgatgata agcatgctat aactgattca gcaactgatt cagtagtttt 1260 

agttggttct caagctatag ttgatggtct tgttgcatca cctttagcat cagaaaaaca 1320 

tgctccatta ttattaactt caaaagataa actagattca aatgttaaat ctgagataaa 1380 

aagagttatg gatttaaaat ctacaagtgg tataaatact tctaaaaaag tttatttagc 1440 

tggtggagtt aactctatat ctaaagaagt tgaaaatgaa ttaaaagata tgggacttaa 1500 

agttacaaga ttatctggtg atgacagata tgaaacttct ttagctatag ctgatgaagt 1560 

aggtcttgat aatgataaag catttgtagt tggtggaact ggattagcag atgctatgag 162 0 

tatagctcca gttgcttctc aattaaagaa atctaatgga gatttagatg tagttgatgg 1680 

agatgctact ccaatagtag ttgtagatgg aaaagctaaa actataaata atgaaacaga 1740 

agatttctta aacaatgcac aagttgatat aataggtgga gaaaacagtg tatctaaaga 1800 

tgtagaaaaa tctatagttg ttgctactgg aaaagagcct aacagaacta gtggagatga i860 

tagacaagct actaatgcag aagtaatgaa agaaactgat tacttcgaaa aaggaagtgt 192 0 

aataaactac ttcgtagcaa aagatggttc tactaaagaa gatcaattag tagatgcttt 1980 

agcagcagct ccagttgcag ctaacttcgg ttctacttat gatggtaaaa acgctaatgg 2040 

aactgtttct ccagctccaa tagtattagc tactgatagc ttatctgcag accaaaatgt 2100 

aggtgtaagt aaatcagtta gtgatgatgg tggaaagaac ttagttcaag ttggtaaagg 2160 

tatcgctagt tcagtaataa gcaaaatgaa agatttatta gatatgtaat ataagtttta 22 2 0 

ataaaacttt aaatataaaa gacttctctt aagagaagac tttttttaca ataaaaaaat 2280 

aaatgcaaaa taaaatagag gctatttata gcctctattt tatgtgagaa atacctaaat 2340 

aaaaagatgt ttaatttatt taagtaaaat cattatacta taaatcaaaa taattgtcta 2400 

ctattatgga tttttaatgt ataatagtaa ttgg 2434 
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LI: Entry 15 of 17 



File: USPT 



Nov 14, 1995 



DOCUMENT-IDENTTFrER: US 5466672 A 

TITLE: Therapeutic use of Clostridium difficile toxin A 

CLAIMS: 

5. A method of treating cancer, comprising: 

a) providing i) a patient with gastrointestinal cancer, and ii) a preparation comprising Clostridium difficile toxin 
A in an aqueous solution in therapeutic amounts that is intravenously injectable; and 

b) intravenously injecting said toxin into said patient. 
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LI: Entry 9 of 17 



File: USPT 



Jun 27, 2000 



DOCUMENT-rDENTIFIER: US 6080725 A 

TITLE: Immunostimulating and vaccine compositions employing saponin analog adjuvants and uses thereof 
CLAIMS: 



12. The vaccine of claim 11, wherein said bacterial antigens are antigens associated with a bacterium selected 
from the group consisting of Helicobacter pylori, Chlamydia pneumoniae. Chlamydia trachomatis, Ureaplasma 
urealyticum, Mycoplasma pneumoniae. Staphylococcus spp.. Staphylococcus aureus. Streptococcus spp.. 
Streptococcus pyogenes, Streptococcus pneumoniae, Streptococcus viridans, Enterococcus faecalis. Neisseria 
meningitidis. Neisseria gonorrhoeae, Bacillus anihrads, Salmonella spp., Salmonella typhi, Vibrio cholera, 
Pasteurella pestis, Pseudomonas aeruginosa, Campylobacter spp., Campylobacter jejuni, Clostridium spp., 
Clostridium difficile, Mycobacterium spp., Mycobacterium tuberculosis, Treponema spp., Borrelia spp., Borrelia 
burgdorferi, Leptospria spp.. Hemophilus ducreyi, Corynebacterium diphtheria, Bordetella pertussis, ' 
Bordetella parapertussis, Bordetella bronchiseptica, hemophilus influenza, Escherichia coli. Shigella spp., 
Eriichia spp., Rickettsia spp. and combinations thereof. 
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File: USPT 



Jul 17, 2001 



DOCUMENT-IDENTIFIER: US 6262029 Bl 

TITLE: Chemically modified saponins and the use thereof as adjuvants 
CLAIMS: 



29. The vaccine of claim 28, wherein said bacterial antigens are antigens ossociated with a bacterium selected 
from the group consisting of Helicobacter pylori. Chlamydia pneumoniae, Chlamydia trachomatis, Ureaplasma 
urealyticum. Mycoplasma pneumoniae, Staphylococcus spp., Staphylococcus aureus, Streptococcus spp., 
Streptococcus pyogenes, Streptococcus pneumoniae, Streptococcus viridans, Enterococcus faecalis, Neisseria 
meningitidis. Neisseria gonorrhoeae. Bacillus anthracis, Salmonella spp., Salmonella typhi, Vibrio cholera, 
Pasteurella pestis, Pseudomonas aeruginosa, Campylobacter spp., Campylobacter jejuni, Clostridium spp., 
Clostridium difficile. Mycobacterium spp., Mycobacterium tuberculosis, Treponema spp., Borrelia spp., Borrelia 
burgdorferi, Leptospria spp.. Hemophilus ducreyi, Corynebacterium diphtheria, Bordetella pertussis, 
Bordetella parapertussis, Bordetella bronchiseptica, hemophilus influenza, Escherichia coli, Shigella spp., 
Eriichia spp., Rickettsia spp. and combinations thereof. 
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Entry name Q8KHB6 
Primary accession number Q8KHB6 


Secondary accession numbers 


None 


Entered in TrEMBL in 


Release 22, October 2002 


Sequence was last modified in 


Release 22, October 2002 


Annotations were last modified in 


Release 22, October 2002 


Name and origin of the protein 1 


Protein name 


5- layer protein variable domain SlpA 
[Fragment] 


Synonyms 


None 


Gene name 


SLPA 


From 


Clostridium difficile fTaxID: 1496] 


Taxonomy 


Bacteria: Firmicutes: Clostridia: 


Clostridiales: Clostridiaceae: Clostridium. 


References 
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SEQUENCE FROM NUCLEIC ACID. 
STRAIN=ATCC 43597, 90-111, and 93-136; 

MEDLINE=22083941; PubMed=12089261: FNCBI. ExPASy. EBI. Israel. 
Japan 1 

Karialainen T.. Saumier N.. Bare M.C.. Delmee M.. Coilionon A.: 


"Clostridium difficile Senotyping Based on slpA Variable Region in S-Layer 
Gene Sequence: an Alternative to Serotyping."; 
J. Clin. Microbiol. 40:2452-2458C2002) 


Comments | 



None 



Cross-references 
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Sequence Information 



Length: 389 A A 
[This Is the length 
of the partial 
sequence] 



Molecular weight: 
41788 Da [This is the 
MW of the partial 
sequence] 



CRC64: C5ED8F4901C18F8C 

[This is a checksum on the 
sequence] 
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Entry name 


Q8KHH9 


Primary accession number 

• 


Q8KHH9 


Secondary accession numbers 


None 


Entered in TrEMBL in 


Release 22, October 2002 


Sequence was last modified in 


Release 22, October 2002 


^Annotations were last modified in 


Release 22, October 2002 


Name and origin of the protein 


Protein name 


5-layer protein voriable domain SlpA 
[Fragment] 


Synonyms 


None 


Gene name 


SLPA 


From 


Clostridium difficile FTaxID: 14961 


Taxonomy 


Bacteria: Firmicutes: Clostridia; 






Clostridiales; Clostridiaceae; Clostridium. 


References 


[1] 


SEQUENCE FROM NUCLEIC ACID. 
STRAIN=56026, 54823, and ATCC 43601; 

MEDLINE=22083941; PubMed=12089261; FNCBI. ExPASy. EBI. Israel. 
Japanl 

Karialainen T.. Saumier N.. Bare M.C.. Delmee M.. Collionon A.: 




"Clostridium difficile 6enotyping Based on slpA Variable Region in S-Layer 
Gene Sequence: an Alternative to Serotyping."; 
J. Clin. Microbiol. 40:2452-2458(2002). 


Comments 



None 



Cross - references 
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Entry information 


Entry name 


Q8KHI4 


Primary acczss\or\ number 


Q8KHI4 


Secondary accession numbers 


None 


Entered in TrEMBL in 


Release 22, October 2002 


Sequence was last modified in 


Release 22, October 2002 


Annotations were last modified in 


Release 22, October 2002 


Name and origin of the protein 


Protein name 


S-layer protein [Fragment] 


Synonyms 


None 


Sene name 


SLPA 


From 


Clostridium difficile fToxIh: 149ft] 


Taxonomy 


Bacteria; Firmicutes: Clostridia: 




Clostridiales; Clostridiaceae; Clostridium. 



lof 4 



8/12/03 9:10 AM 



NiccProt Vtew of TrEMBL: Q8KHI4 



http://us.cxpasy.or9/cgi-bin/niccprot.pi7Q8KHr4 



References 
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STRAIN=ATCC 43598, (5AI95600, and GAI95601; 
MEDLINE=22083941; PubMed=12089261; FNCBI . ExPASy . EBI . Israel . 
Japan ! 



Kar ialainen T. . Saumier N. . Bare M.C. . Delmee M. . Collionon A. : 



"Clostridium difficile Genotyping Based on slpA Variable Region in S-Layer 
Gene Sequence: an Alternative to Serotyping."; 
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of the partial 
sequence] 


Molecular weight: 
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MW of the partial 
sequence] 
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sequence] 
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